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DOCUMENT CREATION THROUGH EMBEDDED SPEECH RECOGNITION 



FIELD OF THE INVENTION 
[0001] This invention relates in general to voice recognition technology and, more 
particularly, to voice recognition technology in a printing device. 

BACKGROUND OF THE INVENTION 
[0002] Often there is a need to transcribe spoken words to printed text. 
Traditionally, transcription has been accomplished by transcriptionists who listen to 
spoken words and transcribe the spoken words to readable documents. The spoken 
words are either spoken concurrently with the transcription or previously recorded 
and played back for the transcriptionist. Manual transcription by a transcriptionist Is 
time consuming, costly, and error prone. 

[0003] Speech recognition software has been developed for recognizing words of 
human speech. In one conventional application of the speech recognition software, 
words are recognized and reproduced as electronic text in a computer word 
processor. The electronic text is then available for editing, manipulation, and printing 
using the functions of the word processor. While convenient, this application of 
speech recognition software requires a user to manually intervene in order to obtain 
a printed record of the electronic text. 

SUMMARY OF THE INVENTION 
[0004] According to principles of the present invention, audio data is captured, 
then filtered to extract text data and print commands. The text data is converted to 
electronic text. The electronic text is accumulated in a text buffer. Audio data is 
captured and filtered until a print command is extracted from the audio data. All of 
the extracted text data is converted to electronic text, which is accumulated in the 
text buffer. When a print command is extracted from the audio data, the electronic 
text Is translated into a rendered document. The electronic text is either translated 
alone or with a form template. The rendered document is printed and the text buffer 
is purged. 

[0005] According to further principles of the present invention, the audio data is 
captured by first sensing the audio data. An analog audio data signal is generated in 
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response to the sensed audio data. The analog audio data signal is digitizing into 
captured audio data. 

[0006] According to further principles of the present invention, the captured audio 
data is filtered by first connparing the captured audio data with text data selection 
criteria and print command selection criteria. The captured audio data meeting the 
print command selection criteria is extracted as a print command. The captured 
audio data meeting the text data selection criteria is extracted as text data, 
[0007] According to further principles of the present invention, the text data is 
converted to electronic text by inputting the text data into a speech recognition 
module and receiving electronic text from the speech recognition module. 

DESCRIPTION OF THE DRAWINGS 
[0008] Figure 1 is a block diagram representing one embodiment of the system of 
the present invention for printing text. 

[0009] Figure 2 is a block diagram representing an alternate embodiment of the 
system of the present invention for printing text. 

[0010] Figure 3 is a flow chart illustrating one embodiment of the method of the 
present invention for printing text. 

DETAILED DESCRIPTION OF THE INVENTION 
[0011] Illustrated in Figure 1 is a system 2 for printing text. System 2 includes a 
remote audio collector 4 and a printer 6. Remote audio collector 4 Includes an audio 
digitizer 8, an audio filter 10, a speech recognition module 12, a text buffer 14, a 
rendering subsystem 16, a processor 18, a storage system 20, and a communication 
interface 22. Additionally, remote audio collector 4 optionally includes a form 
template 24 and a buffer cleaner 26. 

[0012] Audio digitizer 8 is any combination of hardware and executable code 
configured to capture audio data. Executable code is any code capable of execution 
by a processor or processing system. Examples of executable code include 
software, firmware, and hardwired code. 

[0013] In one embodiment, audio digitizer 8 includes a microphone 28 and an 
analog to digital (A/D) converter 30. Microphone 28 is any electronic device 
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configured to sense audio data and generate an analog audio data signal in 
response to the sensed audio data. A/D converter 30 is any combination of 
hardware and executable code configured to digitize the analog audio data signal 
into captured audio data. In one embodiment, A/D converter 30 is a sound card for a 
computer system. 

[0014] Audio filter 1 0 is any combination of hardware and executable code 
configured to filter captured audio data to extract text data and print commands. In 
one embodiment, audio filter 10 includes a selection criteria database 32, a 
comparator 34, a print command extractor 36, and a text data extractor 38. 
[0015] Selection criteria database 32 is any form of database configured to hold 
print command selection criteria 40 and text data selection criteria 42. Print 
command selection criteria 40 is any criteria configured to select a print command 
from the audio data. Examples of print command selection criteria 40 include 
amplitude, frequency, and duration criteria indicative of a print command. The 
amplitude, frequency, and duration criteria may be user configurable to suit a user. 
[0016] Text data selection criteria 42 is any criteria for selecting a text data from 
the audio data. Examples of text data selection criteria 42 include amplitude, 
frequency, and duration criteria indicative of a text data. The amplitude, frequency, 
and duration criteria may be user configurable to suit a user. 
[0017] Comparator 34 is any combination of hardware and executable code 
configured to compare captured audio data with print command selection criteria 40 
and text data selection criteria 42. In one embodiment, comparator 34 is executable 
code performed by processor 18 to compare captured audio data with print 
command selection criteria 40 and text data selection criteria 42. 
[0018] Print command extractor 36 is any combination of hardware and 
executable code configured to extract, as a print command, captured audio data 
meeting the print command selection criteria 40. In one embodiment, print 
command extractor 36 is executable code performed by processor 18 to extract, as a 
print command, captured audio data meeting the print command selection criteria 40. 
[0019] Text data extractor 38 is any combination of hardware and executable 
code configured to extract, as text data, captured audio data meeting the text data 
selection criteria 42. In one embodiment, text data extractor 38 is executable code 
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performed by processor 18 to extract, as text data, captured audio data meeting the 
text data selection criteria 42. 

[0020] Speech recognition module 12 is any combination of hardware and 
executable code configured to convert text data to electronic text. In one 
embodiment, speech recognition module 12 is executable code performed by 
processor 18 to convert text data to electronic text. 

[0021] Text buffer 14 is any combination of hardware and executable code 
configured to accumulate electronic text. In one embodiment, text buffer 14 is a 
portion of storage system 20 designated to accumulate the electronic text. 
[0022] Rendering subsystem 16 is any combination of hardware and executable 
code configured to translate electronic text accumulated in text buffer 14 into a 
rendered document. In one embodiment, rendering subsystem 16 is executable 
code performed by processor 18 to translate electronic text accumulated in text 
buffer 14 into a rendered document. 

[0023] Processor 18 is any device or system configured to process executable 
code. Processor 18 controls remote audio collector 4. 

[0024] Storage system 20 is any system configured to store data or executable 
code. Storage system 20 may also be a program storage system tangibly 
embodying a program, applet, or instructions executable by processor 1 8 for 
performing the method steps of the present invention executable by processor 18. 
Storage system 20 may be any type of storage media such as magnetic, optical, or 
electronic storage media. Storage system 20 is illustrated in Figure 1 as a single 
device. Alternatively, storage system 20 may include a plurality of devices. 
Furthermore, each device of storage system 20 may be embodied in a different 
media type. For example, one device of storage system 20 may be a magnetic 
storage media while another device of storage system 20 is an electronic storage 
media. 

[0025] Communication interface 22 is any device or system configured to allow 
remote audio collector 4 to communicate with printer 6. Examples of communication 
interface 22 include a direct connection interface and a network interface. 
[0026] Form template 24 is any template saved in any suitable fashion for use by 
rendering subsystem 1 6 with the text data. In one embodiment, form template 24 



Case 10011774-1 



5 

includes empty fields into wliich text data is inserted before rendering the text data 
witli form template 24. 

[0027] Buffer cleaner 26 is any combination of liardware and executable code 
configured to purge text buffer 14. In one embodiment, buffer cleaner 26 is 
executable code performed by processor 18 to purging text buffer 14 after the 
rendered document is printed. 

[0028] Printer 6 includes a communication interface 44, an imaging subsystem 
46, and a processor 48. imaging subsystem 46 is any combination of hardware and 
executable code configured to print a rendered document. Processor 48 is any 
device or system configured to process executable code. Processor 48 controls 
printer 6. 

[0029] Communication interface 44 is any device or system configured to allow 
printer 6 to communicate with remote audio collector 4. Examples of communication 
interface 44 include a direct connection interface and a network interface. 
Communication interface 44 is of a type suitable for use with communication 
interface 22 of remote audio collector 4. 

[0030] Illustrated in Figure 2 is an alternate embodiment of system 2, wherein 
components of the remote audio collector illustrated in Figure 1 are embedded in 
printer 6. Components in Figure 2 identically numbered to components in Figure 1 
are identical in function to the components in Figure 1, but may differ in actual 
embodiment. 

[0031] Printer 2 includes audio digitizer 8, audio filter 10, speech recognition 
module 12, text buffer 14, rendering subsystem 16, storage system 20, imaging 
subsystem 46, and processor 48. Additionally, printer 6 optionally includes form 
template 24 and buffer cleaner 26. Audio digitizer 8, audio filter 10, speech 
recognition module 12, text buffer 14, rendering subsystem 16, storage system 20, 
imaging subsystem 46, processor 48, form template 24, and buffer cleaner 26 are as 
previously described. 

[0032] Figure 3 is a flow chart representing steps of one embodiment of the 
present invention. Although the steps represented in Figure 3 are presented in a 
specific order, the present invention encompasses variations in the order of steps. 
Furthermore, additional steps may be executed between the steps illustrated in 
Figure 3 without departing from the scope of the present invention. 
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[0033] Audio data is captured 50. In one embodiment, the audio data is captured 
by sensing the audio data and generating an analog audio data signal in response to 
the sensed audio data. The analog audio data signal is digitized into captured audio 
data. 

[0034] The captured audio data is filtered 52 to extract text data and print 
commands. In one embodiment, captured audio data is filtered 52 by comparing the 
captured audio data with text data selection criteria and print command selection 
criteria. The captured audio data meeting the print command selection criteria is 
extracted as a print command. The captured audio data meeting the text data 
selection criteria is extracted as text data. 

[0035] If a print command is not extracted from the audio data and instead text 
data is extracted from the audio data, the text data is converted 54 to electronic text. 
In one embodiment, the text data is extracted by inputting the text data into speech 
recognition module 12 and receiving electronic text from speech recognition module 
12. 

[0036] The electronic text is accumulated 56 in text buffer 14. In one 
embodiment, the electronic text is accumulated 56 in text buffer 14 by appending the 
electronic text to contents of text buffer 14. 

[0037] The process is repeated until a print command is extracted from the audio 
data. The electronic text accumulated in the text buffer is translated 58 into a 
rendered document. Translating 58 the electronic text into a rendered document 
includes translating the electronic text into a form ready for printing. In one 
embodiment, the electronic text is combined with form template 24 before translating 
58. The electronic text with form template 24 are translated into the rendered 
document. 

[0038] The rendered document is printed 60. Printing 60 the rendered document 
includes reproducing the rendered document onto print media. In one embodiment, 
after the rendered document is printed 60, the text buffer is purged 62. 
[0039] The foregoing description is only illustrative of the invention. Various 
alternatives and modifications can be devised by those skilled in the art without 
departing from the invention. Accordingly, the present invention embraces all such 
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alternatives, modifications, and variances that fall within the scope of the appended 
clainns. 
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